Reinforcement learning

Results: 1147



#Item
511Mind / Educational psychology / Developmental psychology / Reinforcement / Psychology / Learning / Motivation / E-learning / Writing Across the Curriculum / Behavior / Education / Behaviorism

Novel Writing Assignments in the Psychology of Learning John Kulig ~~ ~

Add to Reading List

Source URL: wac.colostate.edu

Language: English - Date: 2002-06-05 19:49:43
512International Conference on Machine Learning / Conference on Neural Information Processing Systems / Reinforcement learning / Partially observable Markov decision process / Institute of Electrical and Electronics Engineers / Artificial intelligence / Machine learning / Statistics

Laurent Charlin – Curriculum Vitae 424 Rue St-Zotique Est Montreal, QC H2S 1L9 +

Add to Reading List

Source URL: www.cs.toronto.edu

Language: English - Date: 2015-03-01 20:46:17
513Economics / Autoregressive conditional heteroskedasticity / Stochastic volatility / Asian option / TVR / Reinforcement learning / Normal distribution / LSm / Economic model / Options / Financial economics / Statistics

Policy Iteration for Learning an Exercise Policy for American Options Yuxi Li, Dale Schuurmans Department of Computing Science, University of Alberta Abstract. Options are important financial instruments, whose prices

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2008-09-08 12:22:02
514Markov models / Computer vision / Robotics / ICub / Humanoid robot / Pi / Reinforcement learning / Robot / Algorithm / Mathematical analysis / Mathematics / Science and technology in Europe

Learning to Fire at Targets by an iCub Humanoid Robot Vishnu K. Nath and Stephen E. Levinson University of Illinois at Urbana-Champaign 405 North Mathews Avenue Urbana, IL 61801

Add to Reading List

Source URL: www.isle.illinois.edu

Language: English - Date: 2013-01-07 15:46:56
515Learning / Neural networks / Reinforcement learning / Supervised learning / Temporal difference learning / Backgammon / E-learning / Algorithm / Backpropagation / Machine learning / Computational neuroscience / Games

Practical Issues in Temporal Difference Learning∗ Gerald Tesauro IBM Thomas J. Watson Research Center PO Box 704, Yorktown Heights, NYUSA Abstract. This paper examines whether temporal difference methods for t

Add to Reading List

Source URL: aass.oru.se

Language: English - Date: 2005-06-14 12:26:47
516Systems theory / Mathematical optimization / Operations research / Equations / Stochastic control / Reinforcement learning / Markov decision process / Bellman equation / Policy / Statistics / Control theory / Dynamic programming

Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Add to Reading List

Source URL: www.jair.org

Language: English - Date: 2013-10-18 15:20:49
517Stochastic control / Reinforcement learning / Markov decision process / Valuation / Policy / Statistics / Dynamic programming / Markov processes

Journal of Artificial Intelligence Research Submitted 01/05; published 1/06 Approximate Policy Iteration with a Policy Language Bias: Solving Relational Markov Decision Processes

Add to Reading List

Source URL: www.jair.org

Language: English - Date: 2009-08-06 19:20:12
518Stochastic control / Q-learning / Reinforcement learning / Markov decision process / Machine learning / SARSA / Statistics / Dynamic programming / Markov processes

13. Reinforcement Learning Read Chapter 13] Exercises 13.1, 13.2, 13.4]  Control learning  Control policies that choose optimal actions  Q learning

Add to Reading List

Source URL: aass.oru.se

Language: English - Date: 2005-03-31 12:57:45
519Convex optimization / Mathematical optimization / Number theory / Topological groups / Markov decision process / Linear programming / Reinforcement learning / Representation theory / Μ operator / Mathematics / Algebra / Operations research

Stable Dual Dynamic Programming Tao Wang Daniel Lizotte Michael Bowling Dale Schuurmans Department of Computing Science

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2007-10-21 19:53:48
520Dynamic programming / Systems theory / Equations / Operations research / Stochastic control / Markov decision process / Reinforcement learning / Bellman equation / Automated planning and scheduling / Statistics / Mathematical optimization / Control theory

Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors Andrey Kolobov? Peng Dai† ∗ Mausam? Daniel S. Weld?

Add to Reading List

Source URL: www.cs.washington.edu

Language: English - Date: 2013-08-13 03:53:46
UPDATE